AITopics

2511.15357

Country: Europe > Sweden (0.30)

Genre: Research Report > Experimental Study (1.00)

Industry:

Health & Medicine > Therapeutic Area > Cardiology/Vascular Diseases (1.00)
Health & Medicine > Health Care Providers & Services (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.89)

arXiv.org Artificial IntelligenceSep-9-2025

Multimodal Reasoning for Science: Technical Report and 1st Place Solution to the ICML 2025 SeePhys Challenge

Liang, Hao, Wu, Ruitao, Zeng, Bohan, Niu, Junbo, Zhang, Wentao, Dong, Bin

Multimodal reasoning remains a fundamental challenge in artificial intelligence. Despite substantial advances in text-based reasoning, even state-of-the-art models such as GPT-o3 struggle to maintain strong performance in multimodal scenarios. To address this gap, we introduce a caption-assisted reasoning framework that effectively bridges visual and textual modalities. Our approach achieved 1st place in the ICML 2025 AI for Math Workshop \& Challenge 2: SeePhys, highlighting its effectiveness and robustness. Furthermore, we validate its generalization on the MathVerse benchmark for geometric reasoning, demonstrating the versatility of our method. Our code is publicly available at https://github.com/OpenDCAI/SciReasoner.

large language model, machine learning, natural language, (18 more...)

2509.06079

Genre: Research Report (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.70)
(2 more...)

Hussain, Aftab, Rabin, Md Rafiqul Islam, Ahmed, Toufique, Xu, Bowen, Devanbu, Premkumar, Alipour, Mohammad Amin

Trojans in Large Language Models of Code: A Critical Review through a Trigger-Based Taxonomy

arXiv.org Artificial IntelligenceMay-5-2024

Large language models (LLMs) have provided a lot of exciting new capabilities in software development. However, the opaque nature of these models makes them difficult to reason about and inspect. Their opacity gives rise to potential security risks, as adversaries can train and deploy compromised models to disrupt the software development process in the victims' organization. This work presents an overview of the current state-of-the-art trojan attacks on large language models of code, with a focus on triggers -- the main design point of trojans -- with the aid of a novel unifying trigger taxonomy framework. We also aim to provide a uniform definition of the fundamental concepts in the area of trojans in Code LLMs. Finally, we draw implications of findings on how code models learn on trigger design.

critical review, language model, prediction, (14 more...)

2405.02828

Country:

North America > United States > California > San Francisco County > San Francisco (0.14)
North America > United States > New York > New York County > New York City (0.05)
North America > Canada > Ontario > Toronto (0.04)
(5 more...)

Genre:

Research Report (0.82)
Overview (0.68)

Industry: Information Technology > Security & Privacy (0.94)

Technology: Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)

Yfantidou, Sofia, Constantinides, Marios, Spathis, Dimitris, Vakali, Athena, Quercia, Daniele, Kawsar, Fahim

Beyond Accuracy: A Critical Review of Fairness in Machine Learning for Mobile and Wearable Computing

arXiv.org Artificial IntelligenceSep-22-2023

The field of mobile and wearable computing is undergoing a revolutionary integration of machine learning. Devices can now diagnose diseases, predict heart irregularities, and unlock the full potential of human cognition. However, the underlying algorithms powering these predictions are not immune to biases with respect to sensitive attributes (e.g., gender, race), leading to discriminatory outcomes. The goal of this work is to explore the extent to which the mobile and wearable computing community has adopted ways of reporting information about datasets and models to surface and, eventually, counter biases. Our systematic review of papers published in the Proceedings of the ACM Interactive, Mobile, Wearable and Ubiquitous Technologies (IMWUT) journal from 2018-2022 indicates that, while there has been progress made on algorithmic fairness, there is still ample room for growth. Our findings show that only a small portion (5%) of published papers adheres to modern fairness reporting, while the overwhelming majority thereof focuses on accuracy or error metrics. To generalize these results across venues of similar scope, we analyzed recent proceedings of ACM MobiCom, MobiSys, and SenSys, IEEE Pervasive, and IEEE Transactions on Mobile Computing Computing, and found no deviation from our primary result. In light of these findings, our work provides practical guidelines for the design and development of mobile and wearable technologies that not only strive for accuracy but also fairness.

acm interact, fairness, wearable ubiquitous technol, (11 more...)

2303.15585

Country:

Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.28)
Europe > Greece > Central Macedonia > Thessaloniki (0.04)
North America > United States > New York > New York County > New York City (0.04)
(12 more...)

Genre: Research Report > New Finding (1.00)

Industry:

Law (1.00)
Information Technology (1.00)
Health & Medicine > Therapeutic Area > Cardiology/Vascular Diseases (1.00)
(5 more...)

Technology:

Information Technology > Data Science > Data Mining (1.00)
Information Technology > Communications (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis (0.46)
(2 more...)

#artificialintelligenceSep-11-2022, 02:35:15 GMT

A critical review of the EU's 'Ethics Guidelines for Trustworthy AI'

Europe has some of the most progressive, human-centric artificial intelligence governance policies in the world. Compared to the heavy-handed government oversight in China or the Wild West-style anything goes approach in the US, the EU's strategy is designed to stoke academic and corporate innovation while also protecting private citizens from harm and overreach. In 2018, the European Commission began its European AI Alliance initiative. The alliance exists so that various stakeholders can weigh-in and be heard as the EU considers its ongoing policies governing the development and deployment of AI technologies. Since 2018, more than 6,000 stakeholders have participated in the dialogue through various venues, including online forums and in-person events.

ai system, guideline, stakeholder, (14 more...)

Country:

Europe (0.55)
Asia > China (0.25)
North America > United States > California (0.05)

Industry: Information Technology > Security & Privacy (0.49)

Technology:

Information Technology > Artificial Intelligence > Issues > Social & Ethical Issues (0.54)
Information Technology > Artificial Intelligence > Applied AI (0.35)

#artificialintelligenceMay-22-2022, 12:48:10 GMT

Why are we failing at the ethics of AI? A critical review

Anja Kaspersen and Wendell Wallach are senior fellows at Carnegie Council for Ethics in International Affairs. In November 2021, they published an article that changed the AI ethics conversation: Why Are We Failing at the Ethics of AI? Six months later, the questions the article raised are no closer to resolution. This article was a don't-hold-your-punches review on the state of AI ethics, with which I am in almost complete agreement. If we want to advance the AI conversation, this is still a good place to start. I've quoted a portion of their article, with my comments interspersed: While it is clear that AI systems offer opportunities across various areas of life, what amounts to a responsible perspective on their ethics and governance is yet to be realized.

ai system, critical review, ethics, (10 more...)

Country:

North America > United States (0.05)
Europe > United Kingdom > England (0.05)

Genre: Personal (0.48)

Industry: Law (0.69)

Technology: Information Technology > Artificial Intelligence > Issues > Social & Ethical Issues (1.00)

#artificialintelligenceFeb-19-2022, 18:25:23 GMT

Tabular Learning -- Gradient Boosting vs Deep Learning( Critical Review)

Review of Deep Learning based models such as DeepInsight, IGTD, SuperTML, NODE, DeepFM, TabNet, Tab-Transformer, AutoInt, FT-Transformer on Tabular data.

architecture, learning, tabular data, (13 more...)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

arXiv.org Artificial IntelligenceMay-11-2021

A Critical Review of Information Bottleneck Theory and its Applications to Deep Learning

Alomrani, Mohammad Ali

In the past decade, deep neural networks have seen unparalleled improvements that continue to impact every aspect of today's society. With the development of high performance GPUs and the availability of vast amounts of data, learning capabilities of ML systems have skyrocketed, going from classifying digits in a picture to beating world-champions in games with super-human performance. However, even as ML models continue to achieve new frontiers, their practical success has been hindered by the lack of a deep theoretical understanding of their inner workings. Fortunately, a known information-theoretic method called the information bottleneck theory has emerged as a promising approach to better understand the learning dynamics of neural networks. In principle, IB theory models learning as a trade-off between the compression of the data and the retainment of information. The goal of this survey is to provide a comprehensive review of IB theory covering it's information theoretic roots and the recently proposed applications to understand deep learning models.

critical review, deep learning, information bottleneck theory, (1 more...)

2105.04405

Genre:

Overview (0.87)
Research Report (0.69)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

#artificialintelligenceDec-1-2020

A critical review on computer vision and artificial intelligence in food industry

Food demand and sustainability to feed the growing population are explained clearly. The technological innovations including 4.0 industry revolution strengthen the agricultural sector. The usage of computer vision and artificial intelligence in the field of agriculture and food industry is deeply elaborated. Emerging technologies such as computer vision and Artificial Intelligence (AI) are estimated to leverage the accessibility of big data for active training and yielding operational real time smart machines and predictable models. This phenomenon of applying vision and learning methods for the improvement of food industry is termed as computer vision and AI driven food industry.

computer vision, computer vision and artificial intelligence, food industry, (5 more...)

Genre: Overview (0.30)

Industry:

Food & Agriculture > Agriculture (1.00)
Consumer Products & Services > Food, Beverage, Tobacco & Cannabis (1.00)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning (0.86)

#artificialintelligenceMay-5-2020, 00:23:08 GMT

A critical review of Star Wars AI

This article has spoilers for just about the entire Star Wars universe. When it comes to fictional portrayals of artificial intelligence technology, the Star Trek universe stands head and shoulders above all others. Series creator Gene Rodenberry's vision for the far future seems just as prescient today, in the era of advanced deep learning, as it did in the 1960s when he unveiled it. Unfortunately, this article is about the AI in Star Wars. Before I go off the rails, I should point out that I'm a light saber-wielding Star Wars fanatic.

droid, star war, star war universe, (9 more...)

Industry:

Media > Film (1.00)
Leisure & Entertainment (1.00)

Technology: Information Technology > Artificial Intelligence (1.00)